Update trtllm-gen fused moe routing kernel and add more kernels #1955

jiahanc · 2025-10-20T21:25:41Z

📌 Description

update the trtllm-gen fused moe headers
add new kernels for trtllm-gen fused moe
- for NvFp4, add tile 256
- for MxFp8 x MxFp4, add 128, 256
- for FP8 per-tensor, add 192, 256
- for FP8 block scale, add 128
update the logics of computeSelectedTileN
add tune_max_num_tokens to FP8 per-tensor and FP8 block scale

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

I have installed pre-commit by running pip install pre-commit (or used your preferred method).
I have installed the hooks with pre-commit install.
I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

Tests have been added or updated as needed.
All tests are passing (unittest, etc.).

Reviewer Notes

coderabbitai · 2025-10-20T21:26:05Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: jiahanc <[email protected]>

Signed-off-by: Siyuan Fu <[email protected]>

jiahanc force-pushed the updateTileNCalc branch 2 times, most recently from 9d9ad95 to 7cd156d Compare October 22, 2025 22:21

jiahanc mentioned this pull request Oct 28, 2025

feat: autotune tile_tokens_dim in trtllm-gen MOE #1980

Merged

5 tasks

jiahanc and others added 6 commits October 30, 2025 11:28

update TileN calculation

e48f282

Signed-off-by: jiahanc <[email protected]>

update trtllm-gen bmm headers

01b5a27

fix typo

b5aa244

minor fix

29c23cd

Add WAR

0e68514

update trtllm-gen to dd8b

a5f9585

Signed-off-by: Siyuan Fu <[email protected]>

IwakuraRein force-pushed the updateTileNCalc branch from b79b913 to a5f9585 Compare October 30, 2025 18:44

IwakuraRein added 2 commits October 30, 2025 16:51

add new tile token dim (WIP)

7995977

fix typo

e7ac015

Signed-off-by: Siyuan Fu <[email protected]>

IwakuraRein force-pushed the updateTileNCalc branch from f060ab9 to e7ac015 Compare October 31, 2025 18:13

IwakuraRein added 4 commits October 31, 2025 14:33

update

c833388

Signed-off-by: Siyuan Fu <[email protected]>

update

d20582f

Signed-off-by: Siyuan Fu <[email protected]>

bump trtllm-gen to ac83a

78004cf

Signed-off-by: Siyuan Fu <[email protected]>

public cubins

8935688

IwakuraRein changed the title ~~Revise the calculation related to TileN in routing of MOE TRTLLM backend~~ Update trtllm-gen fused moe routing kernel and add more kernels Nov 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update trtllm-gen fused moe routing kernel and add more kernels #1955

Update trtllm-gen fused moe routing kernel and add more kernels #1955

Uh oh!

jiahanc commented Oct 20, 2025 •

edited by IwakuraRein

Loading

Uh oh!

coderabbitai bot commented Oct 20, 2025 •

edited

Loading

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update trtllm-gen fused moe routing kernel and add more kernels #1955

Are you sure you want to change the base?

Update trtllm-gen fused moe routing kernel and add more kernels #1955

Uh oh!

Conversation

jiahanc commented Oct 20, 2025 • edited by IwakuraRein Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Description

🔍 Related Issues

🚀 Pull Request Checklist

✅ Pre-commit Checks

🧪 Tests

Reviewer Notes

Uh oh!

coderabbitai bot commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jiahanc commented Oct 20, 2025 •

edited by IwakuraRein

Loading

coderabbitai bot commented Oct 20, 2025 •

edited

Loading